Methods and systems for fast upgrade or reboot for network device

ABSTRACT

Embodiments of the present disclosure are directed to protocol state transition and/or resource state transition tracker configured to monitor, e.g., via filters, for certain protocol state transitions/changes or host hardware resource transitions/changes when a host processor in the control plane that performs such monitoring functions is unavailable or overloaded. The filters, in some embodiments, are pre-computed/computed by the host processor and transmitted to the protocol state transition and/or resource state transition tracker. The protocol state transition and/or resource state transition tracker may be used to implement a fast upgrade operation as well as load sharing and or load balancing operation with control plane associated components.

CROSS REFERENCE TO RELATED APPLICATION

This is a continuation application of U.S. patent application Ser. No.16/748,256, filed Jan. 21, 2020, entitled “METHODS AND SYSTEMS TO TRACKPROTOCOL AND HARDWARE RESOURCE STATE TRANSITIONS,” which is incorporatedby reference herein in its entirety.

TECHNICAL FIELD

Embodiments of the present invention relate to networking equipment, inparticular, hardware and software architecture and components that trackand update data-plane protocol transitions or hardware state transitionsin networking equipment.

BACKGROUND

Modern networking devices such as switches are configured with a dataplane (also referred to as the forwarding plane), a control plane, and amanagement plane.

The data or forwarding plane comprises an amalgamation of hardware andsoftware components that are optimized for speed of processing, and forsimplicity and regularity that are responsible for the forwarding ofpackets through the networking device. The data plane relies on routingand/or forwarding tables that are maintained in high-speed, oftencustomized, memory of the data plane. In most implementations, dataplane components typically include route or network processors thatinterfaces with application-specific integrated circuits (ASICs) and thehigh-speed memory across dedicated data buses or switch fabrics.

The control plane operates with the data plane and is primarilyresponsible for the populating and updating of the routing or forwardingtables, among other things. Control plane hardware components aretypically optimized for customizability, handling policies, handlingexceptions and are often implemented via microprocessor(s) (oftenreferred to as a host processor(s)) that implements instructions storedin local memory.

BRIEF DESCRIPTION OF THE DRAWINGS

The embodiments herein may be better understood by referring to thefollowing description in conjunction with the accompanying drawings inwhich like reference numerals indicate identically or functionallysimilar elements, of which:

FIG. 1A is a diagram of an exemplary network device configured with aprotocol state transition and/or resource state transition trackingmodule in accordance with an illustrative embodiment.

FIG. 1B is a diagram of another exemplary network device configured witha protocol state transition and/or resource state transition trackingmodule in accordance with an illustrative embodiment.

FIG. 2A shows an exemplary network device configured with a protocolstate transition and/or resource state transition tracker module inaccordance with an illustrative embodiment.

FIG. 2B shows an exemplary network device configured with a protocolstate transition and/or resource state transition tracker module inaccordance with another illustrative embodiment.

FIG. 2C shows an exemplary network device configured with a protocolstate transition and/or resource state transition tracker module inaccordance with another illustrative embodiment.

FIG. 3 shows an exemplary network device configured with the protocolstate transition and/or resource state transition tracker module of FIG.2A, 2B, or 2C in accordance with an illustrative embodiment.

FIG. 4 shows an exemplary network device configured to perform updatesto data plane resources during a software upgrade operation, e.g., asdescribed in relation to FIG. 3 , in accordance with an illustrativeembodiment.

FIG. 5A shows an exemplary method of tracking protocol state and/orresource state transitions of the control-plane (e.g., during theunavailable, overloaded state of the control-plane, or as a normalcourse of operation in parallel to the host CPU) in accordance with anillustrative embodiment.

FIG. 5B shows an exemplary method of tracking protocol state and/orresource state transitions of the control-plane (e.g., during theunavailable, overloaded state of the control-plane, or as a normalcourse of operation in parallel to the host CPU) in accordance withanother illustrative embodiment.

FIG. 6 shows an exemplary timing diagram of a method of executing fastupgrade operations in a network device configured with an exemplaryprotocol state transition and/or resource state transition trackermodule in accordance with an illustrative embodiment.

FIG. 7 shows an exemplary timing diagram of another method of executingfast upgrade operations in a network device configured with an exemplaryprotocol state transition and/or resource state transition trackermodule in accordance with an illustrative embodiment.

FIG. 8 shows an exemplary timing diagram of a method of executing loadbalancing and/or load sharing operations in a network device configuredwith an exemplary protocol state transition and/or resource statetransition tracker module in accordance with an illustrative embodiment.

FIG. 9 show an exemplary protocol state transition filter configured toexecute on the protocol state transition and/or resource statetransition tracking module and corresponding actions sequence associatedwith a matched instance of the filter in accordance with an illustrativeembodiment.

FIG. 10 show an exemplary hardware resource state transition filterconfigured to execute on the protocol state transition and/or resourcestate transition tracking module and corresponding actions sequenceassociated with a matched instance of the filter in accordance with anillustrative embodiment.

FIG. 11 shows a timing diagram for an example baseline software upgradeoperation for a switch network device in which a protocol statetransition and/or resource state transition tracker module.

FIG. 12 shows a timing diagram for an example fast-software upgradeoperation for a switch network device in which the network device isconfigured with a protocol state transition and/or resource statetransition tracker module in accordance with an illustrative embodiment.

FIG. 13 shows a timing diagram for another example fast-software upgradeoperation for a switch network device in which the network device isconfigured with a protocol state transition and/or resource statetransition tracker module in accordance with another illustrativeembodiment.

DESCRIPTION OF THE EXAMPLE EMBODIMENTS Overview

In an aspect, an embodiment of the present disclosure is directed to aprotocol state transition and/or resource state transition trackerconfigured to monitor, e.g., via filters, for certain protocol statetransitions/changes or host hardware resource transitions/changes when ahost processor (also referred to herein as a “host CPU”) in the controlplane that performs such monitoring functions is unavailable oroverloaded. The filters, in some embodiments, are pre-computed, prior tothe host processor becoming unavailable, by the host processor andtransmitted to the protocol state transition and/or resource statetransition tracker, e.g., executed in a data plane component, when thehost processor is unavailable or overloaded.

Subsequently, appropriate routing or forwarding tables of the data planeare updated for a given detected transition. In some embodiments, theexemplary protocol state transition or resource state transition trackerstores the detected transition to be later updated by the host processorwhen the host processor becomes available. In other embodiments, thehost processor off-loads the tracking and/or updating of certainprotocol state transition changes or host hardware resource transitionchanges to the exemplary protocol state transition or resource statetransition tracker, thus freeing resources of the host processor withrespect to such protocol state transition changes or host hardwareresource transition changes.

In some embodiments, the exemplary protocol state transition and/orresource state transition tracker is used to monitor for certainprotocol state transition changes or host hardware resource changesduring a bootup operation or a software upgrade operation that makes thehost processor unavailable. The exemplary protocol state-machine orresource tracker can thus act as a proxy for the host processor inkeeping certain routing and forwarding tables synchronized to variousprotocol states of the network. Because the time to create data planeresources (e.g., MAC learning tables, RIB tables, ACL tables, etc.) forforwarding processes/applications can be in the order of minutes, anupgrade of the operating system or application(s) execution on the hostprocessor and the subsequent booting of the host processor and buildingof such data plane resources may disrupt network operations for suchtime period. Indeed, the exemplary protocol state transition and/orresource state transition tracker may facilitate near-instantaneousupgrade operation of switching network devices, e.g., when operating inconcert with available fast upgrade technology while providing shorteroverall system down-time as compared to use of available fast upgradetechnology by themselves, as well as to improve system resourceutilization (e.g., in load sharing operation or in load balancingoperation) and operation by acting as a proxy for the host processor inupdating certain data plane resources. As used herein, “load sharing”refers to the off-loading of certain control plane functions from thehost processor to the protocol state transition and/or resource statetransition tracker; thus, the load on the host processor is shared. And,“load balancing” refers to the protocol state transition and/or resourcestate transition tracker taking on parts of the control plane load ofthe host processor when the host processor is overloaded.

With respect to fast upgrades, although upgrades are available forapplications and operating system executing on the host CPU, because ofthe disruption to the network, such upgrades are often deferred untilmore substantial upgrades are required or scheduled. To this end,security and bug fixes may persist for a longer duration on a givennetwork equipment. Further, in some operating environments, e.g.real-time controls in factory automation and such, disruption of networkconnectivity for a minute or more may cause the entire operation line toreset. Reducing disruption time during minor upgrades to a few secondscan avoid such disruptions and thus may increase the frequency thatupgrades are performed, thereby improving overall system health andsecurity.

The term “data-plane processor” (and “data plane devices”) as usedherein, generally refers to a processing unit involved in switchingand/or routing of packets in the network device as part of thedata-plane. Data-plane processors may include network processors (NPUs),route processors (RPs), switching-ASICs (application-specific integratedcircuit), switching FPGA (field-programmable gate array), CPLD (complexprogrammable logic device), and the like. Data-plane processors are partof the data-plane, which further includes data-plane resourcesoperatively coupled to, or are part of, the data-plane processors.Examples of data plane resources may include, but are not limited to,MAC address table(s), FIB table(s), RIB table(s), ACL table(s), and anyother tables, register contents, content address memory (CAM) contents,tertiary content address memory (TCAM) contents, binarycontent-addressable memory (BCAM) contents, and memory contents (e.g.,non-persistent, volatile, etc.) maintained or used by data-planeprocessors.

The term “host processor”, as used herein, is used interchangeably withthe term “host CPU” and generally refers to cores of a microprocessor ormicrocontroller, e.g., having RISC or CISC architecture, that areconfigured to execute computer instructions within the framework of anoperating system in a networking device.

In an aspect, a network device (e.g. switch) is presented comprising ahost CPU executing instructions for control-plane operations that manageand maintain a plurality of data-plane-associated tables (e.g., L2 MACtable; MAC learning tables; L3 tables; RIB, FIB, etc.) of aswitch-fabric of the network device, the instructions when executed bythe host CPU further computes a plurality of filters to identifyprotocol state and/or resource state transitions; and a processor unitor logic circuit (i.e., a non-host CPU component, e.g., logic circuitsof NPU, RP, ASIC, switching-FPGA, or a core located therein, remotedevice) configured to receive the plurality of filters computed by thehost CPU; and track, via the plurality of filters, protocol state and/orresource state transitions of the control-plane (e.g., during anunavailable, overloaded state of the control-plane or as a normal courseof operation in parallel to the host CPU), wherein the tracked protocolstate and/or resources are used, by the host CPU or the processor unitor logic circuit, to update the plurality of data-plane associatedtables.

In some embodiments, the tracked protocol state and/or resources areused by the processor unit or logic circuit to update the data-planewhen the host CPU is in the unavailable or overloaded state.

In some embodiments, the tracked protocol state and/or resources areused by the host CPU to update the data-plane of a detected protocolstate and/or resources when the host CPU transitions from theunavailable or overloaded state to an available state.

In some embodiments, the tracked protocol state and/or resources areused by the processor unit or logic circuit to update the data-plane inparallel with host CPU operations.

In some embodiments, the network device comprises a data-plane device(e.g., NPU, switching ASIC) that uses said plurality ofdata-plane-associated tables to route packets received at network portsof the network device to other network ports of the network device,wherein the processor unit or logic circuit is implemented in the dataplane device.

In some embodiments, the processor unit or logic circuit is implementedin a remote device external to the data plane.

In some embodiments, the data-plane implements a filter to monitor for aspecific protocol state transition in a received packet during theunavailable state of the host CPU and/or a specific resource statetransition during the unavailable state of the host CPU.

In some embodiments, the plurality of filters are pre-computed by thehost CPU prior to the host CPU entering into the unavailable oroverloaded state.

In some embodiments, the processor unit or logic circuit are implementedin a packet classification engine, a packet-inspection engine,deep-packet inspection engine, an embedded micro-controller inData-plane, and/or ACL TCAMs located within a component of thedata-plane.

In some embodiments, the processor unit or logic circuit executes aplurality of filters for state-transitions for a set of protocols.

In some embodiments, the plurality of filters includes a first filterconfigured to identify a LACP PDU (e.g., LACP control PDU) indicating aprotocol state or resource state change of the logical channel, or oneor more links within the channel.

In some embodiments, the plurality of filters includes a second filterconfigured to identify a BPDU indicating a Spanning Tree Protocol (e.g.,MSTP, RSTP) topology-change notification (TCN) message.

In some embodiments, the plurality of filters includes a third filterconfigured to identify a GIR (graceful insertion and removal) operationof a peer network device (e.g., LLDP/CDP DPU, and/or GIR associatedmessage in BGP, OSPF, RIP, EIGRP, and ISIS) (e.g., for load balancing orload sharing configurations).

In some embodiments, the host CPU is configured to pre-compute a filterto monitor for a specific protocol or resource state transition in areceived packet during the unavailable state of the host CPU.

In some embodiments, the host CPU is configured to pre-compute updateddata-plane entries to redistribute traffic to update the data-plane whenthe filter is matched.

In some embodiments, the processor unit or logic circuit is configuredto monitor for a specified protocol state transition in a receivedcontrol packet; and update the data-plane with pre-computed data-planeentries when specified protocol state transition is detected.

In some embodiments, the processor unit or logic circuit is configuredto identify a received LACP PDU (e.g., LACP control PDU) indicating adown-channel link of a peer network device (e.g., event flag at address1 of the actor-state field); and update the data-plane that a linkaggregation channel associated with peer network device is down (e.g.,by writing a state value to an address in the data-plane associated withthe peer network device).

In some embodiments, the processor unit or logic circuit is configuredto identify a received LACP PDU (e.g., LACP control PDU) indicating astate change of the logical channel, or one or more links within thechannel; and update the data-plane of a disabled-state of a failed-portassociated with the peer network device based on pre-computed hash of amodified local ether-channel to redistribute traffic on other memberslinks based on a set of active links.

In some embodiments, the processor unit or logic circuit is configuredto identify a received BPDU indicating a Spanning Tree Protocol (e.g.,MSTP, RSTP) topology-change notification (TCN) message; and update thedata-plane to move a port that received the TCN message to a blockingstate.

In another aspect, a method is disclosed comprising the steps ofperforming, by a host CPU, control-plane operations that manage andmaintain a plurality of data-plane-associated tables (e.g., L2 MACtable; MAC learning tables; L3 tables; RIB, FIB, etc.) of aswitch-fabric of the network device; computing, by the host CPU, aplurality of filters to identify protocol state and/or resource statetransitions associated with the control plane operations; transmitting,by a host CPU, to a processor unit or logic circuit (i.e., a non-hostCPU component, e.g., logic circuits of NPU, RP, ASIC, switching-FPGA, ora core located therein, remote device), the plurality of computedfilters; receiving, by the processor unit or logic circuit, theplurality of transmitted filters; tracking, via the plurality ofplurality of received filters, implemented in a data plane component ora secondary processing unit, protocol state and/or resource statetransitions of the control-plane (e.g., during the unavailable,overloaded state of the control-plane or as a normal course of operationin parallel to the host CPU), wherein the tracked protocol state and/orresource state transitions tracked by the plurality of received filtersare used to update associated data plane resources.

In another aspect, a non-transitory computer readable medium isdisclosed having instructions stored thereon, wherein execution of theinstructions by a first processor comprising a processor unit/logiccircuit to receive, from a data-plane interface, a plurality of filtersto identify protocol state and/or resource state transitions associatedwith control plane operations, wherein the plurality of filters has beenpre-computed by a host CPU or external CPU configured to performcontrol-plane operations that manage and maintain a plurality ofdata-plane-associated tables (e.g., L2 MAC table; MAC learning tables;L3 tables; RIB, FIB, etc.) of a switch-fabric of the network device; andtrack, via the plurality of received filters, implemented in a dataplane component or a secondary processing unit, protocol state and/orresource state transitions of the control-plane (e.g., during theunavailable, overloaded state of the control-plane or as a normal courseof operation in parallel to the host CPU), wherein the plurality ofdata-plane associated tables of the data plane are updated by the hostCPU or the processor unit or logic circuit based on the tracked protocolstate and/or resources.

Example System

FIG. 1A is a diagram of an examplary network device 100 (shown as 100 a)configured with a protocol state transition and/or resource statetransition tracking module 200 (see, e.g., FIGS. 2A, 2B, 2C) (alsoreferred to as a protocol state machine transition tracker and aresource state machine transition tracker, respectively) in accordancewith an illustrative embodiment. The protocol state transition and/orresource state transition tracking module 200 is configured to monitorfor changes in protocol states represented in routing and/or forwardingtables of the data-plane and/or changes in hardware resource states ofthe network device.

In FIG. 1A, the network device 100 a is configured as a network switchand is shown comprising a plurality of ports 102 coupled to forwardingengine implemented in a route or network processor 104 via a busstructure 106 (shown as “switch fabric” 106). Route or networkprocessors 104 can be used to execute routing protocols, e.g., bymaintaining routing information and forwarding table(s). The route ornetwork processor 104 may have access to fast memory 108 (such asternary content-addressable memory (TCAM), CAM, SRAM, buffers, etc.) andlocal memory 110 (e.g., dynamic random-access memory (DRAM), SRAM)).

The route or network processor 104 may communicate with a host processor105 (also referred to herein as a host CPU and shown as “HostProcessor(s)” 105). As discussed above, a host CPU generally refers to acore of a microprocessor or microcontroller, e.g., having RISC or CISCarchitecture, that is configured to execute general computerinstructions (i.e., applications, middleware) within the framework of anoperating system. Here, computer instructions generally refer to generalinstructions, preferably, that are prepared not to be specifically tiedto a particular computer architecture. The host CPU 105 has a businterconnect 132 (e.g., PCI or PCIe (PCI-express) bus) that serves as adata plane interface to the route or network processors 104 and/or othercomponents of the data-plane. PCIe can refer to PCI-X, PCI-express 16x,PCI-express 1x, and the like. Examples of other bus interconnect is theAGP (accelerated graphics port) bus. In some embodiments, the host CPU105 and route/network processors 104 are co-located on a samesupervisory card 114. In yet other embodiments, the host processor 105is used as a substitute for, or integrated with, the route or networkprocessor 104 or components thereof, e.g., in a network-on-a-chip (NoC).The bus interconnect 132 provides connectivity between the host CPU 105and the data plane 136.

In FIG. 1A, the route/network processors 104 is shown to connect toinline cards 112 (shown as 112 a, 112 b, 112 c, and 112 d) through theswitch fabric 106. Switch fabric may be embodied as a cross-bar switchconfigured to interconnect a plurality of serial channel port interfacesto establish point-to-point wire connections for switching frames amongthe line cards of the switch.

In FIG. 1A, in some embodiments, the ports 102 are shown located on aplurality of in-line cards 112 (shown as 112 a, 112 b, 112 c, and 112 d)and the forwarding engine (i.e., route/network processor 104) is locatedon a supervisor card 114. Each in-line card 112 may include one or moreASIC(s) 116, memory and memory-like resources 118 (e.g., CAM, registers,buffers, and driver 120) to route a frame received at one of its port toanother port or to route the frame to the switch fabric 106 to otherports in the network switch. Other configurations and implementationsmay be implemented. An “ASIC” as used herein may refer to a customizedapplication specific integrated circuit as well as configurableintegrated circuit such as field-programmable gate array (FPGA) andcomplex programmable logic device (CPLD).

Broadly stated, when a frame (also referred to as a packet) is receivedat a port 102 at the line card, the frame is driven over an internal busof the line card 112 based on forwarding decision rendered by the ASIC116 (or local processor) located in the line card or is driven over theswitch fabric 106 to other ports based on forwarding decision renderedby the forwarding engine. Such frames are processed by the data plane(also referred to as the forwarding plane, among other) of the networkdevice. In FIG. 1A, the data-plane 136 is shown as any component andassociated resources involved in the forwarding and routing of usertraffic. The data-plane (e.g., forwarding engine) renders the forwardingdecision by accessing a forwarding or routing table to look-up adestination MAC address of the frame. Frames associate with the controlplane (e.g., those associated layer-2 and/or layer-3 control protocolsuch as Spanning Tree Protocol (STP), Open Shortest Path First (OSPF),Multiprotocol Label Switching (MPLS), Internet Group Management Protocol(IGMP), Intermediate System to Intermediate System (IS-IS), BorderGateway Protocol (BGP), PIM, Enhanced Interior Gateway Routing Protocol(EIGRP), Routing Information Protocol (RIP), virtual LAN (VLAN), VirtualExtensible LAN (VxLAN), etc.) and management plane (e.g., associatedwith telnet, command line interface (CLI), file transfer protocol (FTP),trivial file transfer protocol (TFTP), syslog, secure shell (SSH),simple network management protocol (SNMP), Hypertext Transfer Protocol(HTTP), HTTP Secure (HTTPS), access control lists (ACL), etc.) may alsobe received at the ports but are generally routed to the ASICs or to theroute or network processor 104 to update control and managementoperation of the network device 100 (e.g., 100 a, 100 b, etc.).

The network device 100 (e.g., 100 a) may include, as shown in FIG. 1A,additional cards 122 comprising processors 124 and memory 126 to performother control or supervisory operations of the network device 100 (e.g.,100 a). In some embodiments, the additional cards 122 (as well as in thesupervisory card 114) may be implemented in general-purpose or specialpurpose computing devices environments, virtual network environment, orconfigurations. Components on the additional cards 122 may be connectedto other components via the bus interconnect 132 or the switched fabric.The bus interconnect 132 also may allow the host CPU 105 to connect tothe data-plane 136 via host CPU driver 134.

Computer-executable instructions, such as program modules, beingexecuted by a computing device (e.g., via the host CPU) may be used.Generally, program modules include routines, programs, objects,components, data structures, etc. that perform particular tasks orimplement particular abstract data types. Computer-executableinstructions may execute the Protocol and/or Resource State TransitionTracker functionality to be discussed below.

Distributed computing environments may be used where tasks are performedby remote processing devices that are linked through a communicationsnetwork or other data transmission medium. In a distributed computingenvironment, program modules and other data may be located in both localand remote computer storage media including memory storage devices.

Computing device typically includes a variety of computer readablemedia. Computer readable media can be any available media that can beaccessed by the device and includes both volatile and non-volatilemedia, removable and non-removable media. Computer readable media may beused to store executable instructions for Protocol and/or Resource StateTransition Tracker functionality to be discussed below. Computer storagemedia include volatile and non-volatile, and removable and non-removablemedia implemented in any method or technology for storage of informationsuch as computer readable instructions, data structures, program modulesor other data. Memory, removable storage, and non-removable storage areall examples of computer storage media. Computer storage media include,but are not limited to, RAM, ROM, electrically erasable programread-only memory (EEPROM), flash memory or other memory technology,CD-ROM, digital versatile disks (DVD) or other optical storage, magneticcassettes, magnetic tape, magnetic disk storage or other magneticstorage devices, or any other medium which can be used to store thedesired information, and which can be accessed by computing device. Anysuch computer storage media may be part of computing device.Computer-executable instructions and computer storage media are wellknown in the art and are not discussed at length here.

Computing device may contain communication connection(s) that allow thedevice to communicate with other devices. Computing device may also haveinput device(s) such as a keyboard, mouse, pen, voice input device,touch input device, etc. Output device(s) such as a display, speakers,printer, etc. may also be included. All these devices are well known inthe art and are not discussed at length here.

The instant protocol state transition and/or resource state transitiontracking module may be deployed in various network device. FIG. 1B is adiagram of another network device 100 (shown as 100 b) configured with aprotocol state transition and/or resource state transition trackingmodule (e.g., 200) in accordance with an illustrative embodiment. InFIG. 1B, the network device 100 b is configured as a fixed-configurationswitch. As shown, the switching components (e.g., 116, 118, and 120),supporting data-plane components (e.g., 104, 108, 130) and control-planecomponents (e.g., 105, 110, 134) are integrated into one or more boards.Because of the limited redundancy of supporting data-plane components,implementations of the protocol state transition and/or resource statetransition tracking module in such systems (e.g., 100 b) can beparticularly beneficial to overall system uptime.

Example Protocol State Transition and/or Resource State TransitionTracker FIG. 2A shows an exemplary network device 100 (shown as 100 c)configured with a protocol state transition and/or resource statetransition tracker module 200 (shown as 200 a) in accordance with anillustrative embodiment. A module may include a software application,firmware, middleware, preconfigured logic function of configurablehardware (IP), or a combination thereof. The protocol state transitionand/or resource state transition tracker module 200 may be implementedin a processor unit or logic circuit (PULC) that is a non-host CPUcomponent, including, for example, logic circuits or processing units ofa network processing unit (NPU), routing processor (RP), ASIC,switching-FPGA, or processing core(s) located therein, as well as remotedevices (e.g., OpenFlow controller).

As shown in FIG. 2A, the control-plane (shown as host processor 105)installs filtering rules 204 (shown as “Filter/Rules” 204) in theprotocol state transition and/or resource state transition trackermodule 200 (shown as 200 a) which, when operating in concert with thefiltering rules 204, is configured to match for certain control-planestate-transition messages (see, e.g., FIGS. 6, 7, 8 ) for a set ofprotocols as defined in the filters 204. A filtering rule 204 has one ormore corresponding action instructions or sequences 206 (shown as“Action/Rules” 206) that may be executed upon a match of filterspecified in the filtering rule 204. The action instructions/sequences206 may be installed in the host CPU, in data plane components, or in asecondary processing unit, to perform corresponding actions to updatethe data plane upon a filter being matched. In some embodiments, thefiltering rules 204 and/or corresponding action instructions/sequences206 are pre-computed by the host processor 105. In some embodiments, thefiltering rules 204 are pre-computed by the host processor 105, and thecorresponding action instructions/sequences 206 are subsequentlycomputed when a given filtering rule is matched. In other embodiments,the filtering rules 204 and/or corresponding actioninstructions/sequences are computed by a remote controller (e.g.,OpenFlow controller (not shown; see FIG. 3 )) and are transmitted to thenetwork device 100 (e.g., 100 a, 100 b, 100 c, etc.).

Specifically, in some embodiments, the protocol state transition and/orresource state transition tracker module 200 (e.g., 200 a, 200 b, 200 c)is configured to scan through a set of fields in a protocolcontrol-plane message, e.g., for updates to the protocol state. Theprotocol state transition aspect of the tracker module 200 (e.g., 200 a,200 b, 200 c), in some embodiments, is configured to scan for specificvalues in the fields and flag when a match for a specific value in thefield of interest is found. The filtering logic may be implementedsolely in, or through a variety of, hardware blocks, for example, in ACLTCAM, packet classification engine, deep packet inspection engine,packet parser, among the like.

For example, for certain hardware resource state transitions where aprotocol packet is received, filtering logic may be implemented solelyin, or through a variety of, hardware blocks, for example, in ACL TCAM,packet classification engine, deep packet inspection engine, packetparser, and the like. However, for certain hardware resource statetransitions that does not have an associated packet/frames, an embeddedmicro-controller or other logic circuit may be used to implement aportion of the protocol state transition and/or resource statetransition tracker module 200 (e.g., 200 a, 200 b, 200 c) to track suchhardware resource state transitions.

In addition to specific protocol messages, the protocol state transitionand/or resource state transition tracker module 200, in someembodiments, is also configured to track other events that may impactthe forwarding topology (e.g., link down). In some embodiments, theresource state transition tracker aspect of the module 200, whenoperating in concert with the filtering rules, is configured to matchfor certain resource state-transition signals or messages (not shown;see FIG. 10 ) for a set of hardware resource as defined in the filters.

Indeed, once filtering rules 204 are configured in the protocol statetransition and/or resource state transition tracker module 200 (e.g. 200a, 200 b, 200 c), based on the rules, the module 200 can track/flagevents and/or state transitions (e.g., protocol state transition orresource state transition) that could impact the forwarding topologywhile the data-plane components are running headless (e.g., without thehost CPU 105). The tracked state (once identified and/or matched) areused to update the data-plane (such as shutting down an adjacency,blocking a port, update forwarding/routing tables, etc.) to minimize thenegative impact on the network and, in some embodiments, allow thenetwork device to be kept running while the host CPU is unavailable.More elaborate control-plane actions, such as renegotiation,re-convergence etc. may be subsequently performed after thecontrol-plane is fully or partially functional. In some embodiments, thetracked state (once identified and/or matched) are used to update thedata-plane while the control plane (e.g., host CPU) is unavailable (oroverloaded), e.g., by protocol state transition and/or resource statetransition tracker module 200 or a module (e.g., an update module or asecondary processing unit) operating in conjunction with the protocolstate transition and/or resource state transition tracker module 200. Inother embodiments, the tracked state are used to update to the dataplane by the control plane following the transition of the control plane(e.g., in a primary or secondary tread of the host CPU) from theunavailable (or overloaded) state to an available state.

The protocol state transition and/or resource state transition trackingmodule 200 (e.g., 200 a, 200 b, 200 c) may be implemented, in part, orwhole, in data plane associated components or external device componentsthat can be configured and reconfigured to implement a filter (e.g.,that can match a set of packet-header fields to the desired values(rules)). The data plane associated components or external devicecomponents may be entirely hardware-based (e.g., reconfigurable logicand/or tables), entirety software-based, or a combination of bothhardware-based and software-based filtering. Examples of data planeassociated components or external device components include, but are notlimited to, hardware or software modules that are configured, in whole,or in part, as a packet classification engine, a deep packet inspectionengine, a packet parser, ACL TCAMs, among others. In some embodiments,the protocol state transition and/or resource state transition trackingoperations are implemented in multiple modules.

The protocol state transition and/or resource state transition trackingmodule 200 (e.g., 200 a, 200 b, 200 c) preferably is configured tooperate independently of the host CPU. In some embodiments, the protocolstate transition and/or resource state transition tracking module isconfigured to execute filters when the host CPU is not available or isoverloaded. In other embodiments, for example, where the protocol statetransition and/or resource state transition tracking module is used forload sharing operation, the protocol state transition and/or resourcestate transition tracking module is configured as a co-processor orsecondary processing unit or the like (e.g., in the control-plane or inremote components) that operate in conjunction with the host CPU.

As shown in FIG. 2A, in some embodiments, the network device 100 (e.g.,100 c) is configured to operate with a protocol state updater 208 (shownas 208 a) that performs, using the actions instructions/sequences 206,the update to the data-plane (shown as data-plane forwarding/routingtables 210) while the control plane (e.g., host CPU 105) is unavailable.The protocol state updater 208 (e.g., 208 a) may be a part of theprotocol state transition and/or resource state transition trackingmodule 200. In other embodiments, the protocol state updater 208 (e.g.,208 a) is implemented in other components of the data plane. Theprotocol state updater 208 (e.g., 208 a) may retrieve tracked statetransitions from a table or database 212 (shown as “Tracked transitions”212 a)) maintained and/or populated by the protocol state transitionand/or resource state transition tracking module 200 (e.g., 200 a). Inother embodiments, the protocol state updater 208 (e.g., 208 a) receivestracked state transitions, associated with a matched filter 204, fromthe protocol state transition and/or resource state transition trackermodule 200.

In some embodiments, the protocol state transition and/or resource statetransition tracking module 200 (e.g., 200 a, 200 b, 200 c) is configuredto execute filtering rules for protocol state transitions and/orfiltering rules for resource state transitions and to update a hitcounter upon each match. In some embodiments, the protocol statetransition and/or resource state transition tracking module 200 (e.g.,200 a, 200 b, 200 c) is configured to update a hit flag (rather than acounter) that indicates a match of a given filter. The hit counter orhit flag has an associated address in the table or database 212 to whichthe protocol state updater 208 (e.g., 208 a, 208 c), for example, mayscan to take actions. In some embodiments, the hit flag or hit counterfor a set of filters may be assigned a set of addresses to which thefilter (e.g., a TCAM and associated logic) can update. To this end, theprotocol state updater 208 (e.g., 208 a, 208 c) may scan a set of valuesin the table or database 212 to identify if there are updates to whichthe updated 208 (e.g., 208 a, 208 c) can act. Of course, other dataformats and information may be stored as, or in addition, to a hitcounter or hit flag. For example, in some embodiments, a hit counter orhit flag may be updated along with addresses for corresponding actioninstructions or sequence. In some embodiments, the hit counter or hitflag may be updated along priority information associated with a givenfilter.

In other embodiments, the protocol state transition and/or resourcestate transition tracking module 200 is configured to execute filteringrules for protocol state transitions and/or filtering rules for resourcestate transitions and send a match event to a queue of the protocolstate updater 208. The protocol state updater 208 (e.g., 208 a, 208 c)may then take associated action to a given matched filter based on thematch event information in its queue.

Pre-computed filters (e.g. 204) may be calculated and stored in volatileor non-volatile memory, e.g., of the data plane components.Corresponding action instructions/sequences (e.g., 206) may be stored innon-volatile memory, e.g., in embodiments in which a thread in the hostCPU is used to perform an update to the data plane resource. In someembodiments, the action instructions/sequences (e.g., 206) may be storedin volatile memory where the instructions (e.g., 206) are executed ondata plane components or secondary processing units.

FIG. 2B shows an exemplary network device 100 (shown as 100 d)configured with a protocol state transition and/or resource statetransition tracker module 200 (shown as 200 b) in accordance withanother illustrative embodiment. In FIG. 2B, filters 204 for protocoltransition states and/or for resource transition states are computed,e.g., by the host CPU 105, and installed into the protocol statetransition and/or resource state transition tracker module 200 (e.g.,200 b), e.g., as discussed in relation to FIG. 2A, and the protocolstate transition and/or resource state transition tracker module 200 bis configured execute the filter to track/flag events and/or statetransitions (e.g., protocol state transition or resource statetransition) that could impact the forwarding topology while thedata-plane components are running headless (e.g., without the host CPU105) and store the tracked transitions. Corresponding actioninstructions/sequences 206 for a given filter 204 may be computed, e.g.,by the host CPU 105, concurrently with the computing of the filters 204.

In FIG. 2B, the protocol state transition and/or resource statetransition tracker module 200 (e.g., 200 b) is configured to match astate transition (e.g., match values in sets of packet-header fields topre-defined filter rules) and stored any determined matches in a tableor database 212 (shown as tracked transitions 212 b). In someembodiments, the table or database 212 (e.g., 212 b) is a part of thecomponent executing the protocol state transition and/or resource statetransition tracker module 200. The table or database 212 (e.g., 212 b)may then be accessed, e.g., by the host CPU 105 once it is available, toperform an update of the state transition to the data-planeforwarding/routing tables 210 using an appropriate actioninstruction/sequence 206 corresponding to a given matched filter.

FIG. 2C shows an exemplary network device 100 (shown as 100 e)configured with a protocol state transition and/or resource statetransition tracker module 200 (shown as 200 c) in accordance withanother illustrative embodiment. In FIG. 2C, filters 204 for protocoltransition states and/or for resource transition states are computed,e.g., by the host CPU 105, and installed into the protocol statetransition and/or resource state transition tracker module (e.g., 200c), e.g., as discussed in relation to FIGS. 2A and 2B, and the protocolstate transition and/or resource state transition tracker module (e.g.,200 c) is configured execute the filter to track/flag events and/orstate transitions (e.g., protocol state transition or resource statetransition) that could impact the forwarding topology while thedata-plane components are running headless (e.g., without the host CPU105) and store the tracked transitions. Corresponding actioninstructions/sequences 206 for a given filter 204 may be computed, e.g.,by the host CPU 105, concurrently with the computing of the filter andinstalled into the protocol state updater (e.g., 208 c). Then, theprotocol state transition and/or resource state transition trackermodule (e.g., 200 c) is configured to match a state transition (e.g.,match values in sets of packet-header fields to pre-defined filterrules) and to push a matched/determined state transition (as a signal ormessage) to the protocol state updater 208 (shown as 208 c). In someembodiments, the matched/determined state transition is pushed into aqueue of the protocol state updater 208 (e.g., 208 c). Thematched/determined state transition includes, at least, a filteridentifier corresponding to a filter to which corresponding actioninstructions can be identified and retrieved. The protocol state updater208 (e.g., 208 c) then performs an update of the state transition to thedata-plane forwarding/routing tables 210 according to the actioninstructions/sequences 206 for a given matched filter.

In some embodiments, the protocol state updater 208 (e.g., 208 c)performs the update while the host CPU 105 and/or control plane isunavailable. In other embodiments, the protocol state updater 208 (e.g.,208 c) is configured to perform the update in parallel to control planeoperation executing on the host CPU 105.

Fast Upgrade Application Using Example Protocol State Transition and/orResource State Transition Tracker

FIG. 3 shows an examplary network device 100 (shown as 300) configuredwith the protocol state transition and/or resource state transitiontracker module 200 of FIG. 2A, 2B, or 2C in accordance with anillustrative embodiment.

In FIG. 3 , the network device 300 comprises a protocol state transitionand/or resource state transition tracker module 200 (shown as 200 d)that is configured to operate with fast upgrade application 302 tofacilitate the tracking of protocol state or resource state transitionswhen the host CPU is unavailable for fast upgrade operation (alsoreferred to herein as a Fast Software Upgrade (FSU) operation). One ormore instances of the protocol state transition and/or resource statetransition tracker module 200 d may be implemented or instantiated.

During software upgrade operations, the control plane may be disabled(for the upgrade) and the data-plane may be allowed to run headless.Since the upgrade process may take a few minutes (˜5 minutes) toperform, the data-plane may be running with a stale forwarding topologyin which protocol and resource state changes are not acted upon duringthe duration of the upgrade. Depending on the condition changes, networkand/or security issues may arise (e.g. spanning tree loop leading toflooded traffic storm) and thus software upgrade operations are oftenperformed with the entire network device coming off-line. In someembodiments, e.g., where the network device is used for control andautomation and/or where the network device does not have redundancy(e.g., standby modules), this may create massive disruptions to a givenreal-time control operation.

Conventionally, switching and routing systems often implement softwareupgrade operation by implementing some form of redundancy, e.g., usingbackup modules. For example, route processor may be switched to anactive role, while previously active modules (e.g., host CPU) go throughan upgrade and vice-versa. This topology is often referred to asIn-Service Software Upgrade (ISSU). For non-redundant, standaloneswitching systems that lack such back-up modules (often deployed inaccess application) (e.g., as shown in FIG. 1B as well as in FIG. 1Awhere standby modules are not installed), the process of softwareupgrade may be intrusive where there is only one host CPU in the system.

The instant protocol state transition and/or resource state transitiontracker module 200 (e.g., 200 a-200 e) facilitates the tracking ofcondition changes that can prevent the bringing down of the systemduring the duration of upgrade. Indeed, the protocol state transitionand/or resource state transition tracker module 200 (e.g., 200 a-200 e)may be used in combination with fast software upgrade (FSU) applicationto provide continuous data-plane update services, or the tracking ofprotocol state changes or hardware resource changes when the host CPU isunavailable, to minimize the impact on the data forwarding services ofthe network device. In some embodiments, the instant protocol statetransition and/or resource state transition tracker module 200 (e.g.,200 a-200 e) may provide near continuous operation with minimaldisruption (e.g., less than a second, often in the order ofmilliseconds, disruption) to network devices to improve the overalluptime of the network device. The instant network may do so bydecoupling the data-plane and the control-plane and, in essence, lettingthe data-plane continue to run independently of the hostCPU/control-plane (referred to herein and often described as“headless”). During the headless operation, the data-plane generallyruns with old forwarding state (e.g. MAC addresses, IP routes)information, but with updates tracked by the instant protocol statetransition and/or resource state transition tracker module 200 (e.g.,200 a-200 e). In some embodiments, the host CPU (e.g., 105) may bebrought down (i.e., disabled and/or made unavailable), reloaded, andupgraded with a new version of software upgrade.

Fast upgrade operations may operate in conjunction with graceful restartmechanisms (e.g., as described in IEFT RFC 5187) that address to reducethe impact of the software upgrade by informing its peers ahead of time.Graceful restart mechanisms may alleviate fast upgrade issues for layer3 protocols and their corresponding state machine. In conjunction withthe fast upgrade operations, the network device can additionally addresslayer 2 updates that prevent, or reduce, forwarding of packets toincorrect destinations, creation of network loops, delayed networkconvergence in other systems, and/or security vulnerabilities.

In some embodiments, the network device (e.g., 100) is configured as anon-redundant, fixed configuration switching systems. In someembodiments, the network device (e.g., 100) is configured as aredundant, fixed configuration switching systems. In some embodiments,the network device (e.g., 100) is configured as a non-redundant, modularswitching systems. In some embodiments, the network device (e.g., 100)is configured as a redundant, modular configuration switching systems.In other embodiment, the network device may be routers or othernetworking systems (e.g., having fixed or modular configurations and/orhaving redundant or non-redundant data-plane support components).

FIG. 5A shows an exemplary method of tracking protocol state and/orresource state transitions of the control-plane (e.g., during theunavailable, overloaded state of the control-plane, or as a normalcourse of operation in parallel to the host CPU) in accordance with anillustrative embodiment. FIG. 5B shows an exemplary method of trackingprotocol state and/or resource state transitions of the control-plane(e.g., during the unavailable, overloaded state of the control-plane, oras a normal course of operation in parallel to the host CPU) inaccordance with another illustrative embodiment. Referring to FIG. 5A or5B (and FIG. 3 ), when the software upgrade operation is initiated, thecontrol-plane determines (step 502) a set of filtering rules andinstalls (step 504) filtering rules in the protocol state transitionand/or resource state transition tracker module 200 d, e.g., in thedata-plane. In some embodiments, the filtering rules maybe derived, orreceived, from a network interface that provides for communication witha remote controller 306 (shown as “OpenFlow Controller” 306 a). In someembodiments, the control-plane determines (step 502 b of FIG. 5B) a setof corresponding action instructions/sequences concurrently with thecomputation of the filtering rules. In other embodiments (e.g., FIG.5B), the control-plane determines a corresponding actioninstructions/sequence once a filter has been installed and matched. Theaction instructions/sequences may be stored in software, e.g., innon-volatile memory for execution by the host CPU 105 or in data planecomponents, e.g., executing a protocol state updater (e.g., 208 a, 208c).

The filters 204 (e.g., 204 a-204 d), in some embodiments, provide forthe matching of state-transition messages for a set of protocols (e.g.,those described herein). In some embodiments, for example, when aprotocol's state update is communicated through a set of fields in thatprotocols' messages, the protocol state transition and/or resource statetransition tracker module (e.g., 200 d) is configured to look (step 506)for, e.g., scan, specific values in the fields and flags when a match isidentified. That is, scanning values in fields and flags of receivedprotocol messages may be automatically performed in thePacket-Inspection Engine or TCAM block.

In addition to specific protocol messages, the protocol state transitionand/or resource state transition tracker module 200 d may also trackother events that may impact the forwarding topology (e.g., link down).The filtering logic may be implemented through a variety of hardwareblocks (e.g., ACL TCAM). Any required resources are reserved at systemstartup.

Once the rules are configured in the protocol state transition and/orresource state transition tracker module 200 d (e.g., in thedata-plane), events that could impact the forwarding topology areflagged while the system (data-plane) is running headless. The trackedinformation is then used to perform the necessary updates (step 508) tothe data-plane (such as shutting down an adjacency, blocking a portetc.) to minimize the negative impact on the network. More elaborateactions, such as renegotiation, re-convergence etc. may be performedafter the control-plane is fully functional.

During the software upgrade process, the protocol state transitionand/or resource state transition tracker module 200 d may monitor (e.g.,step 506) events that could impact the forwarding topology and, in someembodiments, apply control-plane associated correction and/or updates(e.g., step 508) as a result of such events. In some embodiments, perstep 508, the protocol state transition and/or resource state transitiontracker module 200 d may perform staggered data-plane update that may beperformed as simple processes running on the host CPU while the systemis recovering (see, e.g., FIG. 12 ). In addition, the protocol statetransition and/or resource state transition tracker module 200 d may beimplemented in available computation resource, e.g., in the data-planeto facilitate more frequent updates. Example of available resourcesinclude microcontrollers in the data-plane itself, e.g., microprocessorcores on certain ASICs in a switch gear hardware.

As shown in FIG. 3 , the protocol state transition and/or resource statetransition tracker module (e.g., 200 d) is shown coupled to a data planeinterface 304 (e.g., bus interconnect 132) that interfaces to the hostCPU 105 executing a fast upgrade application 302. The host CPU 105 alsoexecutes control-plane operations that manages and maintains a pluralityof data-plane-associated tables (e.g., L2 MAC table; MAC learningtables; L3 tables; RIB, FIB, etc.), shown in FIG. 3 as resources 210a-210 d, of the network device 100 (e.g., 100 a-100 e). The fast upgradeoperations 302 (e.g., FSU) provide instructions to the host CPU 105 topre-compute filters 204 to install on the protocol state transitionand/or resource state transition tracker module 200 d (shown as “Filter1” 204 a, “Filter 2” 204 b, “Filter 3” 204 c, and “Filter n” 204 d). Thefilters (e.g., 204 a-204 d) executing on the protocol state transitionand/or resource state transition tracker module (e.g., 200 d) facilitatethe tracking of protocol state transitions and/or resource statetransitions when the host CPU 105 is unavailable, e.g., during a fastsoftware upgrade operation (e.g., FSU).

FIG. 4 shows an exemplary network device 100 (shown as 400) configuredto perform updates to data plane resources during a software upgradeoperation, e.g., as described in relation to FIG. 3 , in accordance withan illustrative embodiment. In FIG. 4 , the protocol state transitionand/or resource state transition tracker module 200 (shown as“Filters/Classification Engine” 200 e) is configured to scan throughsets of fields in received protocol control-plane messages. The filters204 a-204 d may be generated by the host CPU 105, e.g., executing thefast upgrade application 302 (shown as “Protocol State and/or hardwarestate tracking application”) 402 and installed in theFilters/Classification Engine 200 e through the data-plane interface(shown as “Write filters” 408). The data-plane interface 304 may be adata-plane access driver that provides access to the data-plane devices(e.g., NPU, Switching-ASIC) and data-plane resources by the forwardingapplication and/or engine 404. Upon a match, the protocol statetransition and/or resource state transition tracker module 200 e isconfigured, in some embodiments, to store a matched event (410), e.g.,as a hit counter or a hit flag, to a table or database 212 (shown as 212c). Subsequently, the table or database (e.g., 212 c) may be accessed bythe control plane, e.g., executed by the host CPU 105 (shown asforwarding applications 404) executing the instructions associated withthe Protocol State and/or hardware state tracking application 402. Theforwarding application 404 is configured to manage and maintain aplurality of data-plane-associated tables (e.g., L2 MAC table; MAClearning tables; L3 tables; RIB, FIB, etc.) of a switch-fabric of thenetwork device 100 (e.g., 400) by performing reading and writingoperations to such resources (shown as “Write resource” 410 and “Readresource” 412) through the data-plane interface 304.

In other embodiments, the protocol state transition and/or resourcestate transition tracker module 200 e is configured, upon determining amatched event (e.g., matched field(s) in a received protocolcontrol-plane message), to send a command (414) to an update agent 406(e.g., 208 a, 208 c) configured to perform an update to the intendedrouting or forwarding resource (shown as 210 a, 210 b, 210 c, and/or 210d). The update agent 406 may include a queue (e.g., FIFO) to which thecommand is processed in accordance with the sequence it is received. Insome embodiments, the command may include an identifier of a matchedfilter from a plurality of filters to which corresponding actionsequences/instructions may be identified and/or retrieved. In FIG. 4 ,the update agent 406 is configured to write (416) the appropriate dataplane resource 210 using addresses of the data plane resource asidentified in an action instruction corresponding to the matched filter.

Example Methods of Operation of Fast Upgrade Operation

FIG. 6 shows an examplary timing diagram 600 of a method of executingfast upgrade operations in a network device configured with an exemplaryprotocol state transition and/or resource state transition trackermodule 200 (shown as “Transition State Tracker” 602 a) in accordancewith an illustrative embodiment. In FIG. 6 , prior to a fast upgradeoperation (e.g., FSU), data-plane and control-plane associated packets(shown as 603 a for data associated packets and 603 b for control-planeassociated packets) received at ports 102 (shown as “Port(s)” 102 a) ofa network device (e.g., 100) are routed to appropriate ports (also shownas 102 a) of the device through switch-gear hardware and/or switchfabric (shown as “ASIC/Switch Fabric” 604) (e.g., corresponding to 106,112, 116, etc.). As shown in FIG. 6 , data packets 603 a are switched(sequence 606) through the ASIC/Switch Fabric (604), which may accessrouting/forwarding table(s) 210 a, and control-plane associated packets603 b (in sequence 614 a) are directed (608) to the control planeexecuted, in part, by the host CPU 105 (shown as “Host CPU (forwardingapplication” 105 a) that parses (610) the control-plane associatedpackets 603 b to update (612) the appropriate routing/forwardingtable(s) 210 a of determined changes.

Referring to FIG. 6 , upon receiving a command (616) to initiate a fastupgrade operation from a fast-upgrade application 302 (shown as“State-Tracking application” 302 a), the state-tracking application 302a is configured, in some embodiments, to send a notification message 618to the ASIC/Switch Fabric (604) that directs the ASIC/Switch Fabric(604) to relay subsequent received control-plane associated packets(e.g., 630) to the protocol state transition and/or resource statetransition tracker module 200 (shown as the “transition state tracker”602 a), e.g., in addition to relaying it the data-plane interface 304.The state-tracking application 302 a also directs the host CPU 105 a tocompute (620) a set of filters 204 that are then installed (shown as 622a and 622 b) onto the transition state tracker 602 a. Examples offilters 204 are provided in FIGS. 9 and 10 , which is subsequentlydiscussed herein. In some embodiments, corresponding actioninstructions/sequences 206 are also computed in step 620 along with thecomputation of the filters 204.

Referring still to FIG. 6 , upon the transition state tracker 602 abeing configured with the set of filters 204, the host CPU andforwarding application 105 a is subsequently disabled and thus madeunavailable (shown as 624) for control plane update.

As shown in FIG. 6 , during the unavailable period (624) of theforwarding application 105 a, data packets 626 are switched (sequence628) through the ASIC/Switch Fabric (604), in essence “headless” withrespect to the forwarding application 105 a. However, control-planeassociated packets 630 (in sequence 632) are directed (634) to thetransition state tracker 602 a that scans (636), for example, theheaders of the packet 630 using address(es) and value(s) provided in thefilters 204. Upon determining a match with a filter, the transitionstate tracker 602 a, in some embodiments, and as shown in FIG. 6 , isconfigured to direct (638) (e.g., an updater (e.g., 208 a, 208 c)) anupdate to the appropriate routing/forwarding table(s) 210 a of thedetermined changes using the appropriate action instructions/sequences206 corresponding to the matched filter. Subsequently, a data packetsubsequently received (640) that relies on that portion of therouting/forwarding table(s) 210 a may be routed (shown in sequence 641)based on the updated control-plane protocol state information (e.g., in630) while the network device 600 is running headless without the hostCPU and/or forwarding application 105 a.

Referring still to FIG. 6 , subsequently (shown in period 642), once thehost CPU and/or forwarding application 105 a becomes available, thestate-tracking application 302 a sends a notification/command(s) (shownas 644 a and 644 b, respectively) to the transition state tracker 602 aand the ASIC/Switch Fabric 604 to notify them that the forwardingapplication 105 a has transition from the unavailable state to anavailable state. The transition state tracker 602 a, in someembodiments, is configured to disable the filtering/classificationoperation. In some embodiments, the transition state tracker 602 auninstalls the filters 204. In some embodiments, the transition statetracker 602 a is un-instantiated and hardware/software resourcesassociated therewith are freed/made available.

With the forwarding application 105 a now available, subsequent receivedcontrol-plane associated packets 603 c (in sequence 614 b) are directed(646 a) to the control plane (105 a) that parses (646 b) thecontrol-plane associated packets to update (646 c) the appropriaterouting/forwarding table(s) 210 a of determined changes.

FIG. 7 shows an examplary timing diagram 700 of another method ofexecuting fast upgrade operations in a network device configured with anexemplary protocol state transition and/or resource state transitiontracker module 200 (also shown as “Transition State Tracker” 702 a) inaccordance with an illustrative embodiment. FIG. 7 shows similaroperations (e.g., 606, 614 a, 612, 618, among others), as described inrelation to FIG. 6 .

However, rather than the transition state tracker 702 a performing theupdates to the data plane resource, FIG. 7 shows the transition statetracker 702 a being configured to store (706) the tracked protocol statetransition (and/or hardware resource state transitions) in a table ordatabase 704. Subsequently, once the host CPU 105 a becomes available,the host CPU 105 a can access (708) and update (710) the trackedprotocol state transition (and/or hardware resource state transitions)from the database to the appropriate data plane resource. Example ofsuch operations is described in relation to FIG. 12 . In addition,examples of filters may be used are described in relation to FIGS. 9 and10 . In some embodiments, the host CPU 105 a may compute correspondingaction instructions/sequences 206 with the filters 204 and store thecomputed action instructions/sequences in persistent memory for lateruse. In other embodiments, the host CPU 105 a is configured to computethe filters 204 and perform computations of appropriate actioninstructions/sequences 206 only for a given matched filter following theupgrade (shown in FIG. 7 as 642 a).

FIGS. 11, 12, and 13 show exemplary methods of operations 1100, 1200,1300 to perform fast software upgrade for certain classes of switchnetwork devices in accordance with an illustrative embodiment.

Specifically, FIG. 11 shows a timing diagram for an example baselinesoftware upgrade operation (1100) for a switch network device in which aprotocol state transition and/or resource state transition trackermodule is not implemented. FIGS. 12 and 13 each shows a timing diagramfor an example fast-software upgrade operation (1200), similar to thoseof FIG. 11 , for a switch network device but one in which the networkdevice is configured with a protocol state transition and/or resourcestate transition tracker module in accordance with an illustrativeembodiment. In FIG. 12 , the protocol state transition and/or resourcestate transition tracker module 200 is configured for staggered trackingoperation with a second thread in the host CPU in which the protocolstate transition and/or resource state transition tracker module 200 isimplemented in a data plane component to monitor for protocol statetransitions while the main thread of the host CPU is unavailable and toprovide the tracked transitions to the secondary tread to update thedata plane resource while the main thread is unavailable. Similaroperations may be performed for hardware resource transitions.Additionally, the main thread executing the operating system on the hostCPU may perform the update the data plane resource once it becomesavailable.

In FIG. 13 , the protocol state transition and/or resource statetransition tracker module 200 is configured to monitor for protocoland/or hardware resource state transitions while the host CPU isunavailable and to provide the provide the tracked transitions tosecondary processing unit (e.g. a system-on-a chip) to update the dataplane resource while the host CPU is unavailable. Indeed, the protocolstate transition and/or resource state transition tracker module 200 maytrack state changes while the control plane is disabled (i.e., and thedata plane is operating headless) to which some update mechanisms can beperformed.

Baseline Fast Software Upgrade

As noted above, FIG. 11 shows a timing diagram for an example baselinesoftware upgrade operation (1100) for a switch network device in which aprotocol state transition and/or resource state transition trackermodule is not implemented. In FIG. 11 , the process 1100 is shownbeginning with the ASIC actively forwarding packets (1102) in normaloperation. Upon receipt of a fast reload command (shown as “reload fast′1104), e.g., for fast upgrade, the software upgrade operation 1100initiates. Peer related operations are first disabled (1106), e.g., viaa graceful restart operation being initiated. Graceful restart involvesa given network device transmitting a message to peer nodes to informits adjacent neighbors and peers that the network device is enteringmaintenance/restart mode. During a graceful restart, the restartingdevice and its neighbors may continue forwarding packets withoutdisrupting network performance.

Once ready, the control plane of the upgrading device is disabled (1108)in that updates to the data plane as a results of protocol statetransitions or hardware state transition are not processed, and the hostCPU is restarted (1110) with a new kernel being loaded. As shown in FIG.11 , the control plane is fully or partially disabled for a period oftime (1112) in that it cannot process any control plane traffic duringthis period, and the system is unaware of link-state or protocol-statechanges. During the software upgrade, the host CPU is booted with adifferent and new kernel/system image. Once the kernel is loaded, theoperating system is booted (1114). In network devices manufactured byCisco Technology, Inc, (San Jose, Calif.), operating system may includePolaris, IOS XE, IOS XR, & IOS Classic (shown in FIG. 11 as“ISO/POLARIS”). Other operation systems maybe similarly rebooted.Following bootup of the operating system, the forwarding application isinitialized (shown as 1118 to 1120) along with various auxiliaryservices performed by the operating system. In some embodiments, cacheand flush operation is performed to create data-plane resource shadows.Example of cache and flush operation is described in U.S. patentapplication Ser. No. 16/542,183, filed Aug. 15, 2019, entitled “DynamicHardware Resource Shadowing,” which is incorporated by reference herein.In some embodiments, cache and flush operation invokes one or moreresource shadow operation(s) that instantiate a shadowing services agentthat creates an instance of a shadow copy of a data plane resource(e.g., MAC table, FIB table, RIB table, ACL table) that can be used torestore the control plane forwarding application once re-initialized. Insome embodiments, the caching operation of the cache and flush operationmay take minutes to perform (e.g., from 1116 to 1122) while the flushingoperation (e.g. initiated at 1122) takes only a few seconds to execute.Subsequently, the host CPU core operation is resumed (1124) (shown as“Data-Plane to Control-Plane Path Restored” 1124), and forwardingapplication is signaled (1126) to continue (shown as “CPU-Bound TrafficResumes” 1126).

In some embodiments, though not shown (e.g., where cache and flushoperation is not used), the control plane upon being initialized isconfigured to calculate and re-populate MAC table(s), FIB table(s), RIBtable(s), ACL table(s), among others. Such processes may be in the orderof minutes.

Fast Software Upgrade with Staggered Tracking Operation

As noted above, FIG. 12 shows a timing diagram for a fast-softwareupgrade operation (1200) in which a protocol state transition and/orresource state transition tracker module 200 is configured for staggeredtracking operation with a second thread executed on the host CPU inwhich the protocol state transition and/or resource state transitiontracker module 200 is implemented in a data plane component to monitorfor protocol state transitions while the main thread executing theoperating system on the host CPU is unavailable and to provide thetracked transitions to the second thread of the host CPU to update thedata plane resource once the main thread of the host CPU is unavailable.

As shown in FIG. 12 , the process 1200 is shown beginning with the ASICactively forwarding packets (1102) in normal operation as shown in FIG.11 . Graceful operation 1106 is performed and updates to the controlplane by the host CPU 105 is disabled (1108) while the host CPU isrestarted (1110) with a new kernel. However, rather than having a period1112 in which the control plane traffic cannot be tracked, FIG. 12 showsa period 1210 in which the data plane component is tracking controlplane traffic (shown as “DP Tracks SM Updates” 1210). To facilitate suchoperation, in FIG. 12 , the host CPU 105 is shown computing andinstalling filters 204 (shown as “Configure Data-Plane to Track TopologyImpacting Events” 1202) in a protocol state transition and/or resourcestate transition tracker module 200 executing in a data-planecomponents. Example methods to compute and install filters 204 aredescribed in relation to FIG. 2A, FIG. 2B, FIG. 2C, FIG. 3 , FIG. 4 ,FIG. 5A, FIG. 5B, FIG. 6 , FIG. 7 , and FIG. 8 . As discussed above, insome embodiments, the protocol state transition and/or resource statetransition tracking module 200 is installed with filters and is executedin a data plane resource/component such as TCAM, ACL, DPI engine, etc.To this end, the protocol state transition and/or resource statetransition tracker module 200 can track control plane traffic duringperiod 1210 while the operating system executing on the host CPU isbooting and unavailable.

FIG. 12 shows a second thread being executed in the host CPU 105 whilethe main thread executing the operating system, which takes longer toboot, is booting. In FIG. 12 , when the new kernel being loaded, asecond thread (or a set of treads) that is a separate independent threadfrom the main thread is also executed in the host CPU 105.Initialization of both the main thread and the second thread is shown as“IOS/Polaris Boots up” 1212. In some embodiments, when the second threadis executed, a minimal set of data-plane drivers is loaded during 1118.To this end, the protocol state transition and/or resource statetransition tracker module 200 executing in the data-plane components cantrack the entirety of when the host CPU and its associated control planeoperation is not operational while minimal control plane operations canbe restored (prior to the full operating system operation) to service,in a staggered manner, state transition updates that were identified bythe protocol state transition and/or resource state transition trackermodule 200 during such periods prior to the main thread of the operatingsystem being fully restored.

In FIG. 12 , state transitions are shown being identified by theprotocol state transition and/or resource state transition trackermodule 200 at time 1204 a, 1204 b, 1204 c, and 1204 d. However, theactual updates to the data plane may be performed at 1204 b, 1204 c, and1204 d once the second thread executing the updater is executed at 1118.

Indeed, the staggered tracking method facilitates the tracking ofprotocol state transition by data plane components when the main threadof the host CPU 105 is unavailable to perform such tracking. In theexample of FIG. 12 , the host CPU 105 performs the update in a secondthread. As discussed herein, other mechanism to perform the update maybe used when the host CPU is available or unavailable. For example,state transition updates that were identified by the protocol statetransition and/or resource state transition tracker module 200 can beupdated by the main thread (rather than the second thread) when theoperating system is fully restored. Additionally, in some embodiments,state transition updates that were identified by the protocol statetransition and/or resource state transition tracker module 200 can beupdated by a secondary processing unit, e.g., as later described inrelation to FIG. 13 .

In addition, FIG. 12 presents the protocol state transition and/orresource state transition tracking module 200 in the context of fastupgrade operation. Indeed, similar operations may be used to loadsharing and load balancing as discussed herein. In combination withcache and flush operation which can restore data plane resource from agenerated shadow copy, the time that the data-plane and control-planeare fully restored (shown as 1208) is indeed substantially reduced ascompared to the unavailable time of the control plane as shown in FIG.11 . In some embodiments, the second thread may provide “alive”response, e.g., to any control plane inquiry, as a proxy to the mainthread executing the operating system.

Fast Software Upgrade with System-on-a-Chip

As noted above, FIG. 13 shows a timing diagram of an examplefast-software upgrade operation 1300 for the switch network device ofFIG. 11 configured with the protocol state transition and/or resourcestate transition tracker module 200 that is configured to monitor forprotocol and/or hardware resource state transitions while the host CPUis unavailable and to provide the provide the tracked transitions tosecondary processing unit (e.g. a system-on-a chip) to update the dataplane resource while the host CPU is unavailable.

As shown in FIG. 13 , similar to the description of FIG. 12 , theprocess 1300 is shown beginning with the ASIC actively forwardingpackets (1102) in normal operation as shown in FIG. 11 . Gracefuloperation 1106 is performed and updates to the control plane by the hostCPU 105 is disabled (1108) while the host CPU is restarted (1110) with anew kernel. However, rather than having a period 1112 in which thecontrol plane traffic cannot be tracked, FIG. 13 shows a period in whichthe data plane component (executing the protocol state transition and/orresource state transition tracker module 200) is tracking control planetraffic (shown as “DP Tracks SM Updates” 1306 a) via filters executingthere at and providing, directly or indirectly, identified trackedtransitions to a secondary processing unit that performs the updates tothe appropriate data plane resource. To facilitate such operation, inFIG. 13 , the host CPU 105 is shown computing and installing filters 204(also shown as “Configure Data-Plane to Track Topology Impacting Events”1202) in a protocol state transition and/or resource state transitiontracker module 200 executing in a data-plane components. As noted abovein relation to FIG. 12 , example methods to compute and install filters204 are described in relation to FIG. 2A, FIG. 2B, FIG. 2C, FIG. 3 ,FIG. 4 , FIG. 5A, FIG. 5B, FIG. 6 , FIG. 7 , and FIG. 8 , and theprotocol state transition and/or resource state transition trackingmodule 200 may be installed with filters and is executed in a data planeresource/component such as TCAM, ACL, DPI engine, etc. To this end, theprotocol state transition and/or resource state transition trackermodule 200 can track control plane traffic during period 1306 while theoperating system executing on the host CPU is booting and unavailable.

FIG. 13 shows a secondary processing unit (referenced as “M3” in FIG. 13) being programmed with action instructions/sequences to perform updatesto the data plane resource when the host CPU and corresponding operatingsystem are booting. In FIG. 13 , as noted above, the host CPU 105 isshown computing and installing filters in a protocol state transitionand/or resource state transition tracker module 200 executing in adata-plane component at 1202. Then, the protocol state transition and/orresource state transition tracker module 200 is shown enabled at 1304(shown as “SM Tracking On” 1304). At the same time, or contemporaneoustherewith, the secondary processing unit is enabled 1304. After suchtime, any protocol state transitions or resource hardware transitions istracked (shown as “DP Tracks SM Updates” 1306 a) by the protocol statetransition and/or resource state transition tracker module 200 andcorresponding updates to a data plane resource (e.g., 210) is performedby the secondary processing unit (shown as “Check DP for SMTransitions—Update Data-Plane” 1306 b).

To this end, the protocol state transition and/or resource statetransition tracker module 200 executing in the data-plane components cantrack the entirety of when the host CPU and its associated control planeoperation is not operational while the secondary processing unit (M3)can service such tracked transition. In embodiments in which adata-plane component can not track the state transition (e.g., certainhardware resource state transitions), an embedded micro-controller orlogic circuit may be used to implement the protocol state transitionand/or resource state transition tracker module as well as the protocoland/or hardware resource state updater (e.g., 208 c).

In FIG. 12 , state transitions received through a protocol packet areshown being identified by the protocol state transition and/or resourcestate transition tracker module 200 executing in the data plane while,FIG. 13 , hardware resource transitions that are not received via aprotocol packet is updated by the secondary processing unit (M3). In theexample shown in FIG. 13 , the secondary processing unit is shown resetat 1308 after the forwarding application is initialized at “forwardingengine driver initialization completes” 1120.

Indeed, other mechanism to perform the update may be used when the hostCPU is available or unavailable. For example, state transition updatesthat were identified by the protocol state transition and/or resourcestate transition tracker module 200 can be updated by the host CPU onceit's fully restored or partially restored (e.g., per FIG. 12 ).Additionally, in some embodiments, protocol or hardware resource statetransition updates that were identified by the protocol state transitionand/or resource state transition tracker module 200 can be updated bythe secondary processing unit.

In addition, FIG. 13 presents the protocol state transition and/orresource state transition tracking module 200 in the context of fastupgrade operation. Indeed, similar operations may be used to loadsharing and load balancing as discussed herein. In combination withcache and flush operation which can restore data plane resource from agenerated shadow copy, the time that the data-plane and control-planeare fully restored is indeed substantially reduced as compared to theunavailable time of the control plane as shown in FIG. 11 . In someembodiments, the second processing unit may provide “alive” response,e.g., to any control plane inquiry, as a proxy to the applicationexecuting on the host CPU 105.

Example Protocol State and Hardware Resource State Transition Filters

FIG. 9 show an exemplary protocol state transition filter 204 (shown as900) configured to execute on the protocol state transition and/orresource state transition tracking module 200 as well as correspondingaction instructions/sequences 206 (shown as 901) in accordance with anillustrative embodiment. The action instructions/sequences 901 may bepre-computed along with the protocol state transition filter 900 to beexecuted in a data plane component. In other embodiments, the actioninstructions/sequences 901 may be performed by a second thread in thehost CPU or main thread in the host CPU (e.g., as described in relationto FIG. 12 ), in a secondary processing unit (e.g., as described inrelation to FIG. 13 ).

FIG. 10 show an exemplary hardware resource state transition filter 204(shown as 1000) configured to execute on the protocol state transitionand/or resource state transition tracking module 200 as well ascorresponding action instructions/sequences 206 (shown as 1001) inaccordance with an illustrative embodiment.

Indeed, the mechanism as described herein may be generalized to be usedin any switching devices in which protocol state tracking and/orhardware resource tracking is desired during headless operation of thedata-plane. In some embodiments, the protocol state tracking (e.g., inthe context of FSU) may include LACP, CDP/LLDP (neighbor change) andRSTP.

LACP Filter/Rule Example. As shown in FIG. 9 , the protocol statetransition and/or resource state transition tracking module 200 may beconfigured with a rule 204 (shown as 900) that scans for when anether-channel is down. The filter 900 may include instructions 902 forthe protocol state transition and/or resource state transition trackingmodule 200 to scan the header of a given LACP protocol data unit (DPU)message for the actor-state field (902) as well as the n-tuple used inthe system to identify a given interface (904) and the LACP control PDU(906) within the LACP PDU.

When a peer node brings down an ether-channel link, the LACP protocol isexpected to send out a PDU with an ‘actor state’ field (partner's portstate) indicating that the link is going down. Actor-state field in LACPPDU is 8-bits wide. In FIG. 9 , the protocol state transition and/orresource state transition tracking module 200, in executing a filter900, may apply a mask of “0x3c”, e.g., via an “AND” operator, to theactor-state field as shown in 902, and if none of the bits in the mask(e.g., bits 2, 3, 4, and 5) are set, then the resulting operationindicates that the corresponding ether-channel is down. The protocolstate transition and/or resource state transition tracking module 200may store a hit flag or hit counter associated with a matched filter ina table or database. As shown in FIG. 9 , the protocol state transitionand/or resource state transition tracking module 200 is configured tostore the hit flag or hit counter at an address shown as addresses 906 aand 906 b in “Events Flag”. Indeed, the address specified under the“Events-flag” refers to a location in the table or database, e.g., in adata-plane resource, which tracks the corresponding hit-counter/hit-flagfor a given filter 204 (e.g., 900).

The corresponding action instruction/sequence 206, as shown in FIG. 9 ,includes a mask (e.g., 908 a and 908 b) to apply to an address (e.g.,906 a and 906 b) in the table or database (e.g., 212). As shown in FIG.9 , the updater (e.g., 208 c) applies the mask (e.g., 908 a, 908 b) to avalue read from an address of the event flags. In the example shown inFIG. 9 , a protocol state transition associated with port “1” is shownas a hit-flag bit 0 (“0x00001”) at “address 1” and a protocol statetransition associated with port “3” is shown as a hit-flag bit 0(“0x00001”) at “address 3”. To this end, when the updater apply a maskof “0x0001” to a value read from “Events-Flag:address1”, the resultindicates whether a filter matched any incoming control packets. If thehit-counter corresponding to the above rule is non-zero, then theupdater (e.g., 208 c) may update the appropriate data-plane resource toindicate that an interface is marked down. As shown in FIG. 9 , when theport “1” is indicated to be down, the updater (e.g., 208 c) may mark theethernet-channel as down by writing an associated <value> to an<address> as shown in 910 a. And, as shown in FIG. 9 , when the port “2”is indicated to be shown, the updater (e.g., 208 c) may mark all membersof the local ethernet-channel as being down and may pre-calculate a hashto redistribute traffic on other member links based on a set of activelinks as shown in 910 b. Such actions (e.g., 910 a, 910 b) may preventforwarding of traffic on a stale adjacency and helps speed up theconvergence.

RSTP Filter/Rule Example. As shown in FIG. 10 , the protocol statetransition and/or resource state transition tracking module 200 may beconfigured with a rule 204 (shown as 1000) that scans for when a port isbrought down (that is, the port is disabled). The filter 1000 mayinclude instructions 1002 for the protocol state transition and/orresource state transition tracking module 200 to scan a received RapidSpanning Tree Protocol (RSTP) message for a change in a given spanningtree, e.g., as provided via a Topology Change Notification (TCN) messageas well as the n-tuple used in the system to identify a given interface(1004) and the message type fields in the BPDU (e.g., TCN-BPDU) and thecontrol flag fields (1006) within the RSTP TCN message.

In FIG. 10 , the protocol state transition and/or resource statetransition tracking module 200, in executing a filter 1000, maydetermine the message type field in BPDU is a “TCN_BPDU” and that theflags field in BPDU are “TC_FLAG” identify a RSTP TCN message. To thisend, if TCN updates have been received on a given port, the updater(e.g., 208 a, 208 c) executing the corresponding actioninstructions/sequence 206 (e.g., 1001) can update the data planeresource to indicate that a port is blocked. As shown in FIG. 10 , theprotocol state transition and/or resource state transition trackingmodule 200 may store a hit flag or hit counter associated with a matchedfilter in a table or database as described in relation to FIG. 9 . Asshown in FIG. 10 , the protocol state transition and/or resource statetransition tracking module 200 is configured to store the hit flag orhit counter at an address shown as addresses 1006 a, 1006 b, 1006 c in“Hit-Counter”. Indeed, the address specified under the “Hit-Counter”refers to a location in the table or database, e.g., in a data-planeresource, which tracks the corresponding hit-counter/hit-flag for agiven filter 204 (e.g., 1000).

The corresponding action instruction/sequence 206, as shown in FIG. 10 ,includes a set of masks (e.g., 1008 a, 1008 b, 1008 c) to apply torespective addresses (e.g., 1006 a, 1006 b, 1006 c) in the table ordatabase (e.g., 212). As shown in FIG. 10 , the updater (e.g., 208 c)applies the mask (e.g., 1008 a, 1008 b, 1008 c) to a value read from anaddress of the “Hit Counter”. In the example shown in FIG. 10 , aprotocol state transition associated with port “1” (1005 a) is shown asa hit-flag bit 2 “0x00010” (1008 a) at “address 1” (1006 a); a protocolstate transition associated with port “2” (1005 b) is shown as ahit-flag bit 2 “0x00010” (1008 b) at “address 2” (1006 b); a protocolstate transition associated with port “3” (1005 c) is shown as ahit-flag bit 2 “0x00010” (1008 c) at “address 3” (1006 c). To this end,when the updater apply a mask of “0x00010” to a value read from“Hit-Counter:address1”, the result indicates whether a filter matchedany incoming STP TCN message. If the hit-counter corresponding to theabove rule is non-zero, then the updater (e.g., 208 c) may update theappropriate data-plane resource to indicate that a port is blocked. Asshown in FIG. 10 , when the ports “1”, “2”, and “3” are indicated to beblocked, the updater (e.g., 208 c) may mark the respective port asblocked by writing a respective associated <value> to a respective<address> as shown in 1010 a, 1010 b, and 1010 c, respectively. Suchactions (e.g., 1010 a, 1010 b, 1010 c) may prevent forwarding of trafficto blocked ports that could create network issues such as loops or blackholes.

Host CPU Load Balancing Application Using Example Protocol StateTransition and/or Resource State Transition Tracker

Referring back to FIG. 3 , the examplary network device 100 (e.g., 300)may be alternatively, or additionally, configured with the protocolstate transition and/or resource state transition tracker module 200 ofFIG. 2A, 2B, or 2C to perform load balancing operations with the hostCPU in accordance with an illustrative embodiment. As used herein, “loadbalancing” refers to the protocol state transition and/or resource statetransition tracker module 200 performing filtering and/or updating ofprotocol state transitions or resource state transitions when the hostCPU is overloaded.

During load balancing operations, the control plane may be periodicallyunavailable. In some embodiments, e.g., when monitored availability ofthe host CPU is determined to be below a specific threshold (say, 25%available load), a load sharing application 308 may direct the host CPU105 to compute a set of filters 204 that are then installed on aprotocol state transition and/or resource state transition trackermodule 200 (e.g., 200 a-200 e) to offload such monitoring of certainprotocol state transitions and/or resource state transitions from thehost CPU.

In some embodiments, the network device may be a non-redundant,standalone fixed or modular switching systems. In other embodiment, thenetwork device may be routers or other networking systems. An example ofnon-redundant, standalone fixed switching system is shown in FIG. 1B. Anexample of modular switching systems is shown in FIG. 1A.

Referring still to FIG. 3 (and FIG. 5A or 5B), when the load balancingoperation is initiated, the control-plane determines (step 502) a set offiltering rules and installs (step 504) filtering rules in the protocolstate transition and/or resource state transition tracker module 200 d,e.g., in the data-plane. In some embodiments, the filtering rules maybederived, or received, from a network interface that provides forcommunication with a remote controller 306 (shown as “OpenFlowController” 306 a). Corresponding action instructions/sequence 206 maybe calculated concurrently with the filters 204 or may be calculated asneeded.

The filters 204 (e.g., 204 a-204 d), in some embodiments, provide forthe matching of state-transition messages for a set of protocols (e.g.,those described herein). In some embodiments, for example, when aprotocol's state update is communicated through a set of fields in thatprotocols' messages, the protocol state transition and/or resource statetransition tracker module 200 d is configured to look (step 506) for,e.g., scan, specific values in the fields and flags when a match isidentified. In addition to specific protocol messages, the protocolstate transition and/or resource state transition tracker module 200 dmay also track other events that may impact the forwarding topology(e.g., link down). The filtering logic may be implemented through avariety of hardware blocks (e.g., ACL TCAM). Any required resources arereserved at system startup.

Once the rules are configured in the protocol state transition and/orresource state transition tracker module 200 d (e.g., in thedata-plane), events that could impact the forwarding topology areflagged while the system (data-plane) in an overloaded state. Thetracked information is then used to perform the necessary updates (step508) to the data-plane (such as shutting down an adjacency, blocking aport etc.) to minimize the negative impact on the network.

As shown in FIG. 3 , the protocol state transition and/or resource statetransition tracker module 200 d is shown coupled to a data planeinterface 304 (e.g., bus interconnect 132) that interfaces to the hostCPU 105 executing a load balancing operation 308. The host CPU 105 alsoexecutes control-plane operations that manages and maintains a pluralityof data-plane-associated tables (e.g., L2 MAC table; MAC learningtables; L3 tables; RIB, FIB, etc.), shown in FIG. 3 as resources 210a-210 d, of the network device 100 (e.g., 100 a-100 e). The loadbalancing operations 308 provide instructions to the host CPU 105 topre-computer filters 204 to install on the protocol state transitionand/or resource state transition tracker module 200 d (shown as “Filter1” 204 a, “Filter 2” 204 b, “Filter 3” 204 c, and “Filter n” 204 d). Thefilters 204 a-204 d executing on the protocol state transition and/orresource state transition tracker module 200 d facilitate the trackingof protocol state transitions and/or resource state transitions when thehost CPU 105 is overloaded (e.g., having a load level over a definedlimit). The protocol state transitions and/or resource state transitionsmay be implemented in data plane components or in secondary processingunit as discussed herein. A updater (e.g., 208 c) may be implemented insecondary thread in the host CPU (e.g., as describe in relation to FIG.11 ) or in secondary processing unit (e.g., as describe in relation toFIG. 12 ).

FIG. 4 shows an exemplary network device 100 (shown as 400) configuredto perform updates to data plane resources during load balancingoperation, e.g., as described in relation to FIG. 3 , in accordance withan illustrative embodiment. Examples of secondary thread and secondaryprocessing units that may execute the updater (e.g., 208 c) is describedin relation to FIGS. 12 and 13 .

FIG. 8 shows an exemplary timing diagram 800 of a method of executingload balancing operations in a network device configured with anexemplary protocol state transition and/or resource state transitiontracker module 200 (shown as “Transition State Tracker” 802 a) inaccordance with an illustrative embodiment. FIG. 8 shows similaroperations (e.g., 606, 614 a, 612, 618, among others), as described inrelation to FIG. 6 .

However, rather than the host CPU 105 becoming unavailable, in FIG. 8 ,the transition state tracker 200 (shown as 802 a) is configured tooperate in parallel with the host CPU 105 when it is overloaded. Indeed,once the load balancing operation (shown as “state tracking application”308 a) is initialized (804), the host CPU 105 pre-computes (806) thefilters 204 to install (808) on the transition state tracker 802 a.Processing of the control plane messages associated with the filter andmonitoring (810) is performed by the transition state tracker 802 ainstead of the host CPU 105 a.

Host CPU Load Sharing Application Using Example Protocol StateTransition and/or Resource State Transition Tracker

Referring back to FIG. 3 , the examplary network device 300 may bealternatively, or additionally, configured with the protocol statetransition and/or resource state transition tracker module 200 of FIG.2A, 2B, or 2C to perform load sharing operations with the host CPU inaccordance with an illustrative embodiment. As used herein, “loadsharing” refers to the protocol state transition and/or resource statetransition tracker module 200 performing filtering and/or updating ofprotocol state transitions or resource state transitions in parallel tocontrol plane operations performed by the host CPU irrespective of thehost CPU's availability or loading state. Indeed, the function ofupdating certain protocol state transition updates and/or certainhardware resource state transition updates has been off-loaded to theprotocol state transition and/or resource state transition trackermodule 200 entirely.

Referring still to FIG. 3 (and FIG. 5A or 5B), when the load sharingoperation (shown as “Host CPU Load Sharing Application” 310) isinitiated, the control-plane determines (step 502) a set of filteringrules and installs (step 504) filtering rules in the protocol statetransition and/or resource state transition tracker module (e.g., 200d), e.g., in the data-plane. In some embodiments, the filtering rulesmaybe derived, or received, from a network interface that provides forcommunication with a remote controller 306 (shown as “OpenFlowController” 306 a).

The filters 204 (e.g., 204 a-204 d), in some embodiments, provide forthe matching of state-transition messages for a set of protocols (e.g.,those described herein). In some embodiments, for example, when aprotocol's state update is communicated through a set of fields in thatprotocols' messages, the protocol state transition and/or resource statetransition tracker module 200 d is configured to look (step 506) for,e.g., scan, specific values in the fields and flags when a match isidentified. In addition to specific protocol messages, the protocolstate transition and/or resource state transition tracker module (e.g.,200 d) may also track other events that may impact the forwardingtopology (e.g., link down). The filtering logic may be implementedthrough a variety of hardware blocks (e.g., ACL TCAM). Any requiredresources are reserved at system startup.

Once the rules are configured in the protocol state transition and/orresource state transition tracker module 200 d (e.g., in thedata-plane), events associated with such rules that could impact theforwarding topology are flagged independently of the host CPU 105 by theprotocol state transition and/or resource state transition trackermodule (e.g., 200 d). The tracked information is then used to performthe necessary updates (step 508) to the data-plane (such as shuttingdown an adjacency, blocking a port etc.) to minimize the negative impacton the network as described herein. FIG. 4 shows an exemplary networkdevice 100 (shown as 400) configured to perform updates to data planeresources during load sharing operation, e.g., as described in relationto FIG. 3 , in accordance with an illustrative embodiment. Examples ofsecondary thread and secondary processing units that may execute theupdater (e.g., 208 c) is described in relation to FIGS. 12 and 13 .

FIG. 8 shows an example timing diagram 800 of a method of executing loadsharing operations in a network device configured with an exemplaryprotocol state transition and/or resource state transition trackermodule (e.g., 200). Indeed, once the load sharing operation (also shownas “state tracking application” 308 a) is initialized (804), the hostCPU 105 pre-computes (806) the filters 204 to install (808) on thetransition state tracker 802 a. The host CPU 105 a can then ignorescontrol plane associated messages associated with the filter and suchmonitoring (810) is performed by the transition state tracker 802.

It should be understood that the various techniques and modulesdescribed herein, including the protocol state transition and/orresource state transition tracker module (e.g., 200) and/or protocolstate updater (e.g., 208), may be implemented in connection withhardware components or software components or, where appropriate, with acombination of both. Illustrative types of hardware components that canbe used include Field-programmable Gate Arrays (FPGAs),Application-specific Integrated Circuits (ASICs), Application-specificStandard Products (ASSPs), System-on-a-chip systems (SOCs), ComplexProgrammable Logic Devices (CPLDs), etc. The methods and apparatus ofthe presently disclosed subject matter, or certain aspects or portionsthereof, may take the form of program code (i.e., instructions) embodiedin tangible media, such as floppy diskettes, CD-ROMs, hard drives, orany other machine-readable storage medium where, when the program codeis loaded into and executed by a machine, such as a computer, themachine becomes an apparatus for practicing the presently disclosedsubject matter.

Embodiments of the network device (e.g., 100) may be implemented, inwhole or in part, in virtualized network hardware in addition tophysical hardware.

Although exemplary implementations may refer to utilizing aspects of thepresently disclosed subject matter in the context of one or morestand-alone computer systems, the subject matter is not so limited, butrather may be implemented in connection with any computing environment,such as a network or distributed computing environment. Still further,aspects of the presently disclosed subject matter may be implemented inor across a plurality of processing chips or devices, and storage maysimilarly be effected across a plurality of devices. While variousembodiments of the present disclosure have been described above, itshould be understood that they have been presented by way of exampleonly, and not limitation. It will be apparent to persons skilled in therelevant art that various changes in form and detail can be made thereinwithout departing from the spirit and scope of the present disclosure.Thus, the breadth and scope of the present disclosure should not belimited by any of the above-described exemplary embodiments but shouldbe defined only in accordance with the following claims and theirequivalents.

What is claimed is:
 1. A method to perform a fast upgrade or rebootoperation comprising: pre-computing or receiving, by a host CPU of anetwork device, one or more filters each configured to monitor for aprotocol or resource state transition in a received packet during anunavailable state of the host CPU; transmitting, by the host CPU, over abus interconnect, to a processor unit or logic circuit of the data-planeof the network device, the filter; tracking, by the processor unit orlogic circuit, a given protocol state and/or resource state transitionsof the control-plane in the received packet using the filter executingon the processor unit or logic circuit when the host CPU is in anunavailable state; and updating, by the host CPU, adata-plane-associated tables of a switch-fabric of the network devicebased on the tracking when the host CPU is in an available state.
 2. Themethod of claim 1, wherein the host CPU is configured, by instructions,to perform control-plane operations that manage and maintain theplurality of data-plane-associated tables.
 3. The method of claim 1,wherein the processor unit or logic circuit is configured to: identify areceived LACP PDU indicating a down-channel link of a peer networkdevice; and update the data-plane that a link aggregation channelassociated with peer network device is down.
 4. The method of claim 1,wherein the processor unit or logic circuit is configured to: monitorfor (i) a specified protocol state transition in at least one of areceived control packet and (ii) a specific resource state transition;and update the data-plane with pre-computed data-plane entries whenspecified protocol state transition or the specific resource statetransition is detected.
 5. The method of claim 1, wherein the host CPUis configured to pre-compute the filter prior to the host CPU enteringinto the unavailable or overloaded state.
 6. The method of claim 1,wherein the host CPU is configured to receive the filter over a networkinterface.
 7. The method of claim 1, wherein the filter is configured toidentify a LACP PDU indicating a protocol state or resource state changeof the logical channel, or one or more links within the channel.
 8. Themethod of claim 1, wherein the filter is configured to identify a BPDUindicating a Spanning Tree Protocol topology-change notification (TCN)message.
 9. The method of claim 1, wherein the filter is configured toidentify a hardware resource transition change.
 10. The method of claim1, wherein the processor unit or logic circuit is implemented in apacket classification engine, a packet-inspection engine, deep-packetinspection engine, an embedded micro-controller in data-plane, and/orACL TCAMs located within a component of the data-plane.
 11. The methodof claim 1, wherein the processor unit or logic circuit is implementedin a device external to the data plane device.
 12. The method of claim1, wherein the tracked protocol state and/or resources are used by thehost CPU to update the data-plane of a detected protocol state and/orresources when the host CPU transitions from the unavailable oroverloaded state to an available state.
 13. The method of claim 1,further comprising: updating the host CPU when the host CPU is in theunavailable state.
 14. A system to perform a fast upgrade or rebootoperation, the system comprising: a processor unit or logic circuit of adata-plane of the system, the processor unit or logic circuitconfigured, with instructions, to: receive a plurality of filterscomputed by a host CPU local to the system each of the plurality offilters configured to monitor for a protocol or resource statetransition in a received packet during an unavailable state of the hostCPU; and track, via the plurality of filters, protocol state and/orresource state transitions of the control-plane when the host CPU is inan unavailable state; and output the tracked protocol state and/orresource state transitions to update a plurality of data-planeassociated tables based on the tracking when the host CPU is in anavailable state.
 15. The system of claim 14, wherein the host CPU isconfigured, by instructions, to perform control-plane operations thatmanage and maintain the plurality of data-plane-associated tables. 16.The system of claim 14, wherein the processor unit or logic circuit isconfigured, by the plurality of filters, to: identify a received LACPPDU indicating a down-channel link of a peer network device; and updatethe data-plane that a link aggregation channel associated with peernetwork device is down.
 17. The system of claim 14, wherein theprocessor unit or logic circuit is configured, by the plurality offilters, to: monitor for (i) a specified protocol state transition in atleast one of a received control packet and (ii) a specific resourcestate transition; and update the data-plane with pre-computed data-planeentries when specified protocol state transition or the specificresource state transition is detected.
 18. The system of claim 14,wherein the filter is configured to identify at least one of: a LACP PDUindicating a protocol state or resource state change of the logicalchannel, or one or more links within the channel; a BPDU indicating aSpanning Tree Protocol topology-change notification (TCN) message; and ahardware resource transition change.
 19. The system of claim 14, whereinthe processor unit or logic circuit is implemented in a packetclassification engine, a packet-inspection engine, deep-packetinspection engine, an embedded micro-controller in data-plane, and/orACL TCAMs located within a component of the data-plane.